Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 271116 |
| Missing cells | 363853 |
| Missing cells (%) | 8.9% |
| Duplicate rows | 612 |
| Duplicate rows (%) | 0.2% |
| Total size in memory | 31.0 MiB |
| Average record size in memory | 120.0 B |
Variable types
| Numeric | 5 |
|---|---|
| Text | 6 |
| Categorical | 4 |
| Dataset has 612 (0.2%) duplicate rows | Duplicates |
Height is highly overall correlated with Weight | High correlation |
Weight is highly overall correlated with Height and 1 other fields | High correlation |
Year is highly overall correlated with City | High correlation |
Sex is highly overall correlated with Weight | High correlation |
Season is highly overall correlated with City | High correlation |
City is highly overall correlated with Year and 1 other fields | High correlation |
Age has 9474 (3.5%) missing values | Missing |
Height has 60171 (22.2%) missing values | Missing |
Weight has 62875 (23.2%) missing values | Missing |
Medal has 231333 (85.3%) missing values | Missing |
Reproduction
| Analysis started | 2024-03-20 13:12:08.763264 |
|---|---|
| Analysis finished | 2024-03-20 13:12:20.654835 |
| Duration | 11.89 seconds |
| Software version | ydata-profiling vv4.6.0 |
| Download configuration | config.json |
ID
Real number (ℝ)
| Distinct | 135571 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68248.954 |
| Minimum | 1 |
|---|---|
| Maximum | 135571 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 7347.75 |
| Q1 | 34643 |
| median | 68205 |
| Q3 | 102097.25 |
| 95-th percentile | 128978 |
| Maximum | 135571 |
| Range | 135570 |
| Interquartile range (IQR) | 67454.25 |
Descriptive statistics
| Standard deviation | 39022.286 |
|---|---|
| Coefficient of variation (CV) | 0.57176387 |
| Kurtosis | -1.1972922 |
| Mean | 68248.954 |
| Median Absolute Deviation (MAD) | 33738 |
| Skewness | -0.0046811565 |
| Sum | 1.8503384 × 1010 |
| Variance | 1.5227388 × 109 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 77710 | 58 | < 0.1% |
| 106296 | 39 | < 0.1% |
| 115354 | 38 | < 0.1% |
| 119591 | 36 | < 0.1% |
| 129196 | 32 | < 0.1% |
| 44875 | 32 | < 0.1% |
| 53240 | 32 | < 0.1% |
| 119590 | 32 | < 0.1% |
| 89187 | 32 | < 0.1% |
| 106156 | 31 | < 0.1% |
| Other values (135561) | 270754 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 6 | |
| 6 | 8 | |
| 7 | 8 | |
| 8 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 135571 | 2 | |
| 135570 | 2 | |
| 135569 | 1 | |
| 135568 | 1 | |
| 135567 | 2 | |
| 135566 | 1 | |
| 135565 | 2 | |
| 135564 | 1 | |
| 135563 | 2 | |
| 135562 | 1 |
Name
Text
| Distinct | 134732 |
|---|---|
| Distinct (%) | 49.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 108 |
|---|---|
| Median length | 78 |
| Mean length | 19.34199 |
| Min length | 2 |
Characters and Unicode
| Total characters | 5243923 |
|---|---|
| Distinct characters | 63 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 77009 ? |
|---|---|
| Unique (%) | 28.4% |
Sample
| 1st row | A Dijiang |
|---|---|
| 2nd row | A Lamusi |
| 3rd row | Gunnar Nielsen Aaby |
| 4th row | Edgar Lindenau Aabye |
| 5th row | Christine Jacoba Aaftink |
| Value | Count | Frequency (%) |
| john | 3881 | 0.5% |
| de | 3794 | 0.5% |
| robert | 2597 | 0.4% |
| william | 2329 | 0.3% |
| james | 2027 | 0.3% |
| peter | 2007 | 0.3% |
| van | 1966 | 0.3% |
| michael | 1928 | 0.3% |
| david | 1925 | 0.3% |
| joseph | 1854 | 0.3% |
| Other values (108718) | 716928 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 491302 | 9.4% |
| 470440 | 9.0% | |
| e | 428351 | 8.2% |
| i | 348275 | 6.6% |
| n | 348236 | 6.6% |
| r | 334503 | 6.4% |
| o | 298902 | 5.7% |
| l | 233064 | 4.4% |
| s | 189413 | 3.6% |
| t | 165499 | 3.2% |
| Other values (53) | 1935938 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3885189 | |
| Uppercase Letter | 751692 | 14.3% |
| Space Separator | 470440 | 9.0% |
| Other Punctuation | 60783 | 1.2% |
| Dash Punctuation | 39474 | 0.8% |
| Close Punctuation | 18174 | 0.3% |
| Open Punctuation | 18170 | 0.3% |
| Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 491302 | |
| e | 428351 | |
| i | 348275 | 9.0% |
| n | 348236 | 9.0% |
| r | 334503 | 8.6% |
| o | 298902 | 7.7% |
| l | 233064 | 6.0% |
| s | 189413 | 4.9% |
| t | 165499 | 4.3% |
| h | 130797 | 3.4% |
| Other values (16) | 916847 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 67864 | 9.0% |
| A | 59013 | 7.9% |
| S | 58822 | 7.8% |
| J | 49127 | 6.5% |
| B | 40397 | 5.4% |
| K | 38545 | 5.1% |
| C | 38204 | 5.1% |
| R | 36780 | 4.9% |
| G | 36229 | 4.8% |
| L | 36200 | 4.8% |
| Other values (16) | 290511 |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 49602 | |
| . | 6909 | 11.4% |
| , | 2801 | 4.6% |
| ' | 1449 | 2.4% |
| & | 19 | < 0.1% |
| / | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 470440 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 39474 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 18174 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 18170 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4636881 | |
| Common | 607042 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 491302 | 10.6% |
| e | 428351 | 9.2% |
| i | 348275 | 7.5% |
| n | 348236 | 7.5% |
| r | 334503 | 7.2% |
| o | 298902 | 6.4% |
| l | 233064 | 5.0% |
| s | 189413 | 4.1% |
| t | 165499 | 3.6% |
| h | 130797 | 2.8% |
| Other values (42) | 1668539 |
Common
| Value | Count | Frequency (%) |
| 470440 | ||
| " | 49602 | 8.2% |
| - | 39474 | 6.5% |
| ) | 18174 | 3.0% |
| ( | 18170 | 3.0% |
| . | 6909 | 1.1% |
| , | 2801 | 0.5% |
| ' | 1449 | 0.2% |
| & | 19 | < 0.1% |
| / | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5243923 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 491302 | 9.4% |
| 470440 | 9.0% | |
| e | 428351 | 8.2% |
| i | 348275 | 6.6% |
| n | 348236 | 6.6% |
| r | 334503 | 6.4% |
| o | 298902 | 5.7% |
| l | 233064 | 4.4% |
| s | 189413 | 3.6% |
| t | 165499 | 3.2% |
| Other values (53) | 1935938 |
Sex
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
| M | |
|---|---|
| F |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 271116 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | M |
| 5th row | F |
Common Values
| Value | Count | Frequency (%) |
| M | 196594 | |
| F | 74522 | 27.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 196594 | |
| f | 74522 | 27.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 196594 | |
| F | 74522 | 27.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 271116 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 196594 | |
| F | 74522 | 27.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 271116 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 196594 | |
| F | 74522 | 27.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 271116 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 196594 | |
| F | 74522 | 27.5% |
Age
Real number (ℝ)
MISSING 
| Distinct | 74 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9474 |
| Missing (%) | 3.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.556898 |
| Minimum | 10 |
|---|---|
| Maximum | 97 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 21 |
| median | 24 |
| Q3 | 28 |
| 95-th percentile | 37 |
| Maximum | 97 |
| Range | 87 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 6.3935608 |
|---|---|
| Coefficient of variation (CV) | 0.25016967 |
| Kurtosis | 6.2706424 |
| Mean | 25.556898 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.7471225 |
| Sum | 6686758 |
| Variance | 40.87762 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 21875 | 8.1% |
| 24 | 21720 | 8.0% |
| 22 | 20814 | 7.7% |
| 25 | 19707 | 7.3% |
| 21 | 19164 | 7.1% |
| 26 | 17675 | 6.5% |
| 27 | 16025 | 5.9% |
| 20 | 15258 | 5.6% |
| 28 | 14043 | 5.2% |
| 19 | 11643 | 4.3% |
| Other values (64) | 83718 |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 11 | 13 | < 0.1% |
| 12 | 39 | < 0.1% |
| 13 | 187 | 0.1% |
| 14 | 837 | 0.3% |
| 15 | 2203 | 0.8% |
| 16 | 3852 | 1.4% |
| 17 | 5376 | |
| 18 | 8152 | |
| 19 | 11643 |
| Value | Count | Frequency (%) |
| 97 | 1 | < 0.1% |
| 96 | 1 | < 0.1% |
| 88 | 3 | < 0.1% |
| 84 | 1 | < 0.1% |
| 81 | 2 | < 0.1% |
| 80 | 3 | < 0.1% |
| 77 | 2 | < 0.1% |
| 76 | 7 | |
| 75 | 4 | < 0.1% |
| 74 | 12 |
Height
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 95 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 60171 |
| Missing (%) | 22.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 175.33897 |
| Minimum | 127 |
|---|---|
| Maximum | 226 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 127 |
|---|---|
| 5-th percentile | 158 |
| Q1 | 168 |
| median | 175 |
| Q3 | 183 |
| 95-th percentile | 193 |
| Maximum | 226 |
| Range | 99 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 10.518462 |
|---|---|
| Coefficient of variation (CV) | 0.059989301 |
| Kurtosis | 0.17772797 |
| Mean | 175.33897 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.018477298 |
| Sum | 36986879 |
| Variance | 110.63805 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 180 | 12492 | 4.6% |
| 170 | 11976 | 4.4% |
| 178 | 10708 | 3.9% |
| 175 | 10320 | 3.8% |
| 183 | 8284 | 3.1% |
| 168 | 8211 | 3.0% |
| 173 | 7843 | 2.9% |
| 172 | 7813 | 2.9% |
| 165 | 7246 | 2.7% |
| 185 | 6839 | 2.5% |
| Other values (85) | 119213 | |
| (Missing) | 60171 |
| Value | Count | Frequency (%) |
| 127 | 7 | < 0.1% |
| 128 | 1 | < 0.1% |
| 130 | 2 | < 0.1% |
| 131 | 2 | < 0.1% |
| 132 | 9 | < 0.1% |
| 133 | 6 | < 0.1% |
| 135 | 14 | |
| 136 | 28 | |
| 137 | 18 | |
| 138 | 20 |
| Value | Count | Frequency (%) |
| 226 | 3 | < 0.1% |
| 223 | 4 | < 0.1% |
| 221 | 4 | < 0.1% |
| 220 | 6 | < 0.1% |
| 219 | 2 | < 0.1% |
| 218 | 13 | |
| 217 | 11 | |
| 216 | 12 | |
| 215 | 19 | |
| 214 | 16 |
Weight
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 220 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 62875 |
| Missing (%) | 23.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70.702393 |
| Minimum | 25 |
|---|---|
| Maximum | 214 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 25 |
|---|---|
| 5-th percentile | 50 |
| Q1 | 60 |
| median | 70 |
| Q3 | 79 |
| 95-th percentile | 95 |
| Maximum | 214 |
| Range | 189 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 14.34802 |
|---|---|
| Coefficient of variation (CV) | 0.20293542 |
| Kurtosis | 2.0175229 |
| Mean | 70.702393 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.79716903 |
| Sum | 14723137 |
| Variance | 205.86568 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 70 | 9625 | 3.6% |
| 60 | 7994 | 2.9% |
| 75 | 7810 | 2.9% |
| 68 | 7284 | 2.7% |
| 65 | 7236 | 2.7% |
| 72 | 6252 | 2.3% |
| 80 | 6214 | 2.3% |
| 73 | 5937 | 2.2% |
| 63 | 5869 | 2.2% |
| 64 | 5764 | 2.1% |
| Other values (210) | 138256 | |
| (Missing) | 62875 |
| Value | Count | Frequency (%) |
| 25 | 6 | < 0.1% |
| 28 | 14 | < 0.1% |
| 30 | 42 | < 0.1% |
| 31 | 23 | < 0.1% |
| 32 | 41 | < 0.1% |
| 33 | 51 | < 0.1% |
| 34 | 73 | |
| 35 | 92 | |
| 36 | 137 | |
| 37 | 173 |
| Value | Count | Frequency (%) |
| 214 | 2 | < 0.1% |
| 198 | 1 | < 0.1% |
| 190 | 1 | < 0.1% |
| 182 | 2 | < 0.1% |
| 180 | 1 | < 0.1% |
| 178 | 1 | < 0.1% |
| 176.5 | 2 | < 0.1% |
| 175 | 1 | < 0.1% |
| 170 | 5 | |
| 167 | 2 | < 0.1% |
Team
Text
| Distinct | 1184 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 47 |
|---|---|
| Median length | 39 |
| Mean length | 8.4146601 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2281349 |
|---|---|
| Distinct characters | 72 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 103 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | China |
|---|---|
| 2nd row | China |
| 3rd row | Denmark |
| 4th row | Denmark/Sweden |
| 5th row | Netherlands |
| Value | Count | Frequency (%) |
| united | 19046 | 5.6% |
| states | 18179 | 5.4% |
| germany | 15068 | 4.5% |
| france | 11999 | 3.6% |
| great | 11812 | 3.5% |
| britain | 11414 | 3.4% |
| italy | 10260 | 3.0% |
| canada | 9279 | 2.7% |
| japan | 8289 | 2.5% |
| sweden | 8052 | 2.4% |
| Other values (1317) | 214526 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 324455 | |
| e | 199934 | 8.8% |
| n | 195074 | 8.6% |
| i | 170815 | 7.5% |
| t | 154682 | 6.8% |
| r | 140338 | 6.2% |
| l | 84992 | 3.7% |
| o | 80470 | 3.5% |
| s | 76328 | 3.3% |
| d | 74872 | 3.3% |
| Other values (62) | 779389 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1863640 | |
| Uppercase Letter | 337030 | 14.8% |
| Space Separator | 66808 | 2.9% |
| Decimal Number | 6417 | 0.3% |
| Dash Punctuation | 6260 | 0.3% |
| Other Punctuation | 798 | < 0.1% |
| Open Punctuation | 198 | < 0.1% |
| Close Punctuation | 198 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 324455 | |
| e | 199934 | |
| n | 195074 | |
| i | 170815 | |
| t | 154682 | |
| r | 140338 | 7.5% |
| l | 84992 | 4.6% |
| o | 80470 | 4.3% |
| s | 76328 | 4.1% |
| d | 74872 | 4.0% |
| Other values (16) | 361680 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 56302 | |
| G | 32456 | |
| C | 29629 | 8.8% |
| U | 29530 | 8.8% |
| B | 27135 | 8.1% |
| A | 20878 | 6.2% |
| F | 18207 | 5.4% |
| I | 17642 | 5.2% |
| N | 15441 | 4.6% |
| R | 13485 | 4.0% |
| Other values (16) | 76325 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2910 | |
| 2 | 2829 | |
| 3 | 413 | 6.4% |
| 4 | 107 | 1.7% |
| 9 | 35 | 0.5% |
| 7 | 34 | 0.5% |
| 6 | 31 | 0.5% |
| 5 | 23 | 0.4% |
| 8 | 18 | 0.3% |
| 0 | 17 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 227 | |
| " | 226 | |
| , | 138 | |
| . | 127 | |
| / | 43 | 5.4% |
| # | 37 | 4.6% |
Space Separator
| Value | Count | Frequency (%) |
| 66808 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6260 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 198 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 198 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2200670 | |
| Common | 80679 | 3.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 324455 | |
| e | 199934 | 9.1% |
| n | 195074 | 8.9% |
| i | 170815 | 7.8% |
| t | 154682 | 7.0% |
| r | 140338 | 6.4% |
| l | 84992 | 3.9% |
| o | 80470 | 3.7% |
| s | 76328 | 3.5% |
| d | 74872 | 3.4% |
| Other values (42) | 698710 |
Common
| Value | Count | Frequency (%) |
| 66808 | ||
| - | 6260 | 7.8% |
| 1 | 2910 | 3.6% |
| 2 | 2829 | 3.5% |
| 3 | 413 | 0.5% |
| ' | 227 | 0.3% |
| " | 226 | 0.3% |
| ( | 198 | 0.2% |
| ) | 198 | 0.2% |
| , | 138 | 0.2% |
| Other values (10) | 472 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2281349 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 324455 | |
| e | 199934 | 8.8% |
| n | 195074 | 8.6% |
| i | 170815 | 7.5% |
| t | 154682 | 6.8% |
| r | 140338 | 6.2% |
| l | 84992 | 3.7% |
| o | 80470 | 3.5% |
| s | 76328 | 3.3% |
| d | 74872 | 3.3% |
| Other values (62) | 779389 |
NOC
Text
| Distinct | 230 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 813348 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | CHN |
|---|---|
| 2nd row | CHN |
| 3rd row | DEN |
| 4th row | DEN |
| 5th row | NED |
| Value | Count | Frequency (%) |
| usa | 18853 | 7.0% |
| fra | 12758 | 4.7% |
| gbr | 12256 | 4.5% |
| ita | 10715 | 4.0% |
| ger | 9830 | 3.6% |
| can | 9733 | 3.6% |
| jpn | 8444 | 3.1% |
| swe | 8339 | 3.1% |
| aus | 7638 | 2.8% |
| hun | 6607 | 2.4% |
| Other values (220) | 165943 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 96188 | |
| A | 87196 | 10.7% |
| U | 79908 | 9.8% |
| S | 67289 | 8.3% |
| N | 60744 | 7.5% |
| E | 54771 | 6.7% |
| G | 44903 | 5.5% |
| I | 33230 | 4.1% |
| B | 31123 | 3.8% |
| C | 29054 | 3.6% |
| Other values (16) | 228942 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 813348 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 96188 | |
| A | 87196 | 10.7% |
| U | 79908 | 9.8% |
| S | 67289 | 8.3% |
| N | 60744 | 7.5% |
| E | 54771 | 6.7% |
| G | 44903 | 5.5% |
| I | 33230 | 4.1% |
| B | 31123 | 3.8% |
| C | 29054 | 3.6% |
| Other values (16) | 228942 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 813348 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 96188 | |
| A | 87196 | 10.7% |
| U | 79908 | 9.8% |
| S | 67289 | 8.3% |
| N | 60744 | 7.5% |
| E | 54771 | 6.7% |
| G | 44903 | 5.5% |
| I | 33230 | 4.1% |
| B | 31123 | 3.8% |
| C | 29054 | 3.6% |
| Other values (16) | 228942 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 813348 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 96188 | |
| A | 87196 | 10.7% |
| U | 79908 | 9.8% |
| S | 67289 | 8.3% |
| N | 60744 | 7.5% |
| E | 54771 | 6.7% |
| G | 44903 | 5.5% |
| I | 33230 | 4.1% |
| B | 31123 | 3.8% |
| C | 29054 | 3.6% |
| Other values (16) | 228942 |
Games
Text
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Characters and Unicode
| Total characters | 2982276 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1992 Summer |
|---|---|
| 2nd row | 2012 Summer |
| 3rd row | 1920 Summer |
| 4th row | 1900 Summer |
| 5th row | 1988 Winter |
| Value | Count | Frequency (%) |
| summer | 222552 | |
| winter | 48564 | 9.0% |
| 1992 | 16413 | 3.0% |
| 1988 | 14676 | 2.7% |
| 2000 | 13821 | 2.5% |
| 1996 | 13780 | 2.5% |
| 2016 | 13688 | 2.5% |
| 2008 | 13602 | 2.5% |
| 2004 | 13443 | 2.5% |
| 2012 | 12920 | 2.4% |
| Other values (27) | 158773 |
Most occurring characters
| Value | Count | Frequency (%) |
| m | 445104 | |
| 271116 | ||
| e | 271116 | |
| r | 271116 | |
| 1 | 225799 | |
| 9 | 222816 | |
| S | 222552 | |
| u | 222552 | |
| 0 | 185309 | 6.2% |
| 2 | 162937 | 5.5% |
| Other values (10) | 481859 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1355580 | |
| Decimal Number | 1084464 | |
| Space Separator | 271116 | 9.1% |
| Uppercase Letter | 271116 | 9.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 225799 | |
| 9 | 222816 | |
| 0 | 185309 | |
| 2 | 162937 | |
| 8 | 94098 | |
| 6 | 87494 | 8.1% |
| 4 | 57036 | 5.3% |
| 7 | 22461 | 2.1% |
| 5 | 15792 | 1.5% |
| 3 | 10722 | 1.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 445104 | |
| e | 271116 | |
| r | 271116 | |
| u | 222552 | |
| i | 48564 | 3.6% |
| n | 48564 | 3.6% |
| t | 48564 | 3.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 222552 | |
| W | 48564 | 17.9% |
Space Separator
| Value | Count | Frequency (%) |
| 271116 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1626696 | |
| Common | 1355580 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 271116 | ||
| 1 | 225799 | |
| 9 | 222816 | |
| 0 | 185309 | |
| 2 | 162937 | |
| 8 | 94098 | 6.9% |
| 6 | 87494 | 6.5% |
| 4 | 57036 | 4.2% |
| 7 | 22461 | 1.7% |
| 5 | 15792 | 1.2% |
Latin
| Value | Count | Frequency (%) |
| m | 445104 | |
| e | 271116 | |
| r | 271116 | |
| S | 222552 | |
| u | 222552 | |
| W | 48564 | 3.0% |
| i | 48564 | 3.0% |
| n | 48564 | 3.0% |
| t | 48564 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2982276 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| m | 445104 | |
| 271116 | ||
| e | 271116 | |
| r | 271116 | |
| 1 | 225799 | |
| 9 | 222816 | |
| S | 222552 | |
| u | 222552 | |
| 0 | 185309 | 6.2% |
| 2 | 162937 | 5.5% |
| Other values (10) | 481859 |
Year
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 35 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1978.3785 |
| Minimum | 1896 |
|---|---|
| Maximum | 2016 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 1896 |
|---|---|
| 5-th percentile | 1920 |
| Q1 | 1960 |
| median | 1988 |
| Q3 | 2002 |
| 95-th percentile | 2016 |
| Maximum | 2016 |
| Range | 120 |
| Interquartile range (IQR) | 42 |
Descriptive statistics
| Standard deviation | 29.877632 |
|---|---|
| Coefficient of variation (CV) | 0.015102081 |
| Kurtosis | -0.20694758 |
| Mean | 1978.3785 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | -0.81773578 |
| Sum | 5.3637006 × 108 |
| Variance | 892.67289 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1992 | 16413 | 6.1% |
| 1988 | 14676 | 5.4% |
| 2000 | 13821 | 5.1% |
| 1996 | 13780 | 5.1% |
| 2016 | 13688 | 5.0% |
| 2008 | 13602 | 5.0% |
| 2004 | 13443 | 5.0% |
| 2012 | 12920 | 4.8% |
| 1972 | 11959 | 4.4% |
| 1984 | 11588 | 4.3% |
| Other values (25) | 135226 |
| Value | Count | Frequency (%) |
| 1896 | 380 | 0.1% |
| 1900 | 1936 | 0.7% |
| 1904 | 1301 | 0.5% |
| 1906 | 1733 | 0.6% |
| 1908 | 3101 | |
| 1912 | 4040 | |
| 1920 | 4292 | |
| 1924 | 5693 | |
| 1928 | 5574 | |
| 1932 | 3321 |
| Value | Count | Frequency (%) |
| 2016 | 13688 | |
| 2014 | 4891 | 1.8% |
| 2012 | 12920 | |
| 2010 | 4402 | 1.6% |
| 2008 | 13602 | |
| 2006 | 4382 | 1.6% |
| 2004 | 13443 | |
| 2002 | 4109 | 1.5% |
| 2000 | 13821 | |
| 1998 | 3605 | 1.3% |
Season
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
| Summer | |
|---|---|
| Winter |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1626696 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Summer |
|---|---|
| 2nd row | Summer |
| 3rd row | Summer |
| 4th row | Summer |
| 5th row | Winter |
Common Values
| Value | Count | Frequency (%) |
| Summer | 222552 | |
| Winter | 48564 | 17.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| summer | 222552 | |
| winter | 48564 | 17.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| m | 445104 | |
| e | 271116 | |
| r | 271116 | |
| S | 222552 | |
| u | 222552 | |
| W | 48564 | 3.0% |
| i | 48564 | 3.0% |
| n | 48564 | 3.0% |
| t | 48564 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1355580 | |
| Uppercase Letter | 271116 | 16.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 445104 | |
| e | 271116 | |
| r | 271116 | |
| u | 222552 | |
| i | 48564 | 3.6% |
| n | 48564 | 3.6% |
| t | 48564 | 3.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 222552 | |
| W | 48564 | 17.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1626696 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 445104 | |
| e | 271116 | |
| r | 271116 | |
| S | 222552 | |
| u | 222552 | |
| W | 48564 | 3.0% |
| i | 48564 | 3.0% |
| n | 48564 | 3.0% |
| t | 48564 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1626696 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| m | 445104 | |
| e | 271116 | |
| r | 271116 | |
| S | 222552 | |
| u | 222552 | |
| W | 48564 | 3.0% |
| i | 48564 | 3.0% |
| n | 48564 | 3.0% |
| t | 48564 | 3.0% |
City
Categorical
HIGH CORRELATION 
| Distinct | 42 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
| London | |
|---|---|
| Athina | 15556 |
| Sydney | 13821 |
| Atlanta | 13780 |
| Rio de Janeiro | 13688 |
| Other values (37) |
Length
| Max length | 22 |
|---|---|
| Median length | 14 |
| Mean length | 7.7807912 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2109497 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Barcelona |
|---|---|
| 2nd row | London |
| 3rd row | Antwerpen |
| 4th row | Paris |
| 5th row | Calgary |
Common Values
| Value | Count | Frequency (%) |
| London | 22426 | 8.3% |
| Athina | 15556 | 5.7% |
| Sydney | 13821 | 5.1% |
| Atlanta | 13780 | 5.1% |
| Rio de Janeiro | 13688 | 5.0% |
| Beijing | 13602 | 5.0% |
| Barcelona | 12977 | 4.8% |
| Los Angeles | 12423 | 4.6% |
| Seoul | 12037 | 4.4% |
| Munich | 10304 | 3.8% |
| Other values (32) | 130502 |
Length
| Value | Count | Frequency (%) |
| london | 22426 | 6.7% |
| athina | 15556 | 4.6% |
| sydney | 13821 | 4.1% |
| atlanta | 13780 | 4.1% |
| rio | 13688 | 4.1% |
| de | 13688 | 4.1% |
| janeiro | 13688 | 4.1% |
| beijing | 13602 | 4.1% |
| barcelona | 12977 | 3.9% |
| city | 12697 | 3.8% |
| Other values (41) | 189277 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 214605 | 10.2% |
| o | 207230 | 9.8% |
| e | 193828 | 9.2% |
| a | 164703 | 7.8% |
| i | 156422 | 7.4% |
| l | 114486 | 5.4% |
| r | 96081 | 4.6% |
| t | 92438 | 4.4% |
| 64084 | 3.0% | |
| s | 59391 | 2.8% |
| Other values (35) | 746229 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1719503 | |
| Uppercase Letter | 322407 | 15.3% |
| Space Separator | 64084 | 3.0% |
| Other Punctuation | 2608 | 0.1% |
| Dash Punctuation | 895 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 214605 | |
| o | 207230 | |
| e | 193828 | |
| a | 164703 | |
| i | 156422 | |
| l | 114486 | 6.7% |
| r | 96081 | 5.6% |
| t | 92438 | 5.4% |
| s | 59391 | 3.5% |
| d | 58332 | 3.4% |
| Other values (15) | 361987 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 55786 | |
| S | 47059 | |
| L | 45517 | |
| M | 41210 | |
| B | 33085 | |
| R | 21807 | 6.8% |
| C | 17103 | 5.3% |
| J | 13688 | 4.2% |
| T | 12084 | 3.7% |
| P | 10162 | 3.2% |
| Other values (6) | 24906 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1307 | |
| . | 1301 |
Space Separator
| Value | Count | Frequency (%) |
| 64084 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 895 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2041910 | |
| Common | 67587 | 3.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 214605 | 10.5% |
| o | 207230 | 10.1% |
| e | 193828 | 9.5% |
| a | 164703 | 8.1% |
| i | 156422 | 7.7% |
| l | 114486 | 5.6% |
| r | 96081 | 4.7% |
| t | 92438 | 4.5% |
| s | 59391 | 2.9% |
| d | 58332 | 2.9% |
| Other values (31) | 684394 |
Common
| Value | Count | Frequency (%) |
| 64084 | ||
| ' | 1307 | 1.9% |
| . | 1301 | 1.9% |
| - | 895 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2109497 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 214605 | 10.2% |
| o | 207230 | 9.8% |
| e | 193828 | 9.2% |
| a | 164703 | 7.8% |
| i | 156422 | 7.4% |
| l | 114486 | 5.4% |
| r | 96081 | 4.6% |
| t | 92438 | 4.4% |
| 64084 | 3.0% | |
| s | 59391 | 2.8% |
| Other values (35) | 746229 |
Sport
Text
| Distinct | 66 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 20 |
| Mean length | 9.5066577 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2577407 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Basketball |
|---|---|
| 2nd row | Judo |
| 3rd row | Football |
| 4th row | Tug-Of-War |
| 5th row | Speed Skating |
| Value | Count | Frequency (%) |
| athletics | 38624 | 11.5% |
| gymnastics | 27365 | 8.2% |
| swimming | 24104 | 7.2% |
| skiing | 18899 | 5.7% |
| shooting | 11448 | 3.4% |
| hockey | 10933 | 3.3% |
| cycling | 10859 | 3.2% |
| fencing | 10735 | 3.2% |
| rowing | 10595 | 3.2% |
| skating | 9445 | 2.8% |
| Other values (68) | 161473 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 311765 | 12.1% |
| n | 236992 | 9.2% |
| t | 196051 | 7.6% |
| e | 156041 | 6.1% |
| s | 149741 | 5.8% |
| g | 144194 | 5.6% |
| l | 142623 | 5.5% |
| o | 133948 | 5.2% |
| c | 111472 | 4.3% |
| a | 103764 | 4.0% |
| Other values (37) | 890816 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2178883 | |
| Uppercase Letter | 334820 | 13.0% |
| Space Separator | 63364 | 2.5% |
| Dash Punctuation | 340 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 311765 | |
| n | 236992 | |
| t | 196051 | 9.0% |
| e | 156041 | 7.2% |
| s | 149741 | 6.9% |
| g | 144194 | 6.6% |
| l | 142623 | 6.5% |
| o | 133948 | 6.1% |
| c | 111472 | 5.1% |
| a | 103764 | 4.8% |
| Other values (15) | 492292 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 84409 | |
| A | 53391 | |
| C | 40724 | |
| G | 27612 | 8.2% |
| B | 21451 | 6.4% |
| F | 20715 | 6.2% |
| W | 15107 | 4.5% |
| H | 14598 | 4.4% |
| R | 11730 | 3.5% |
| T | 9763 | 2.9% |
| Other values (10) | 35320 |
Space Separator
| Value | Count | Frequency (%) |
| 63364 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 340 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2513703 | |
| Common | 63704 | 2.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 311765 | 12.4% |
| n | 236992 | 9.4% |
| t | 196051 | 7.8% |
| e | 156041 | 6.2% |
| s | 149741 | 6.0% |
| g | 144194 | 5.7% |
| l | 142623 | 5.7% |
| o | 133948 | 5.3% |
| c | 111472 | 4.4% |
| a | 103764 | 4.1% |
| Other values (35) | 827112 |
Common
| Value | Count | Frequency (%) |
| 63364 | ||
| - | 340 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2577407 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 311765 | 12.1% |
| n | 236992 | 9.2% |
| t | 196051 | 7.6% |
| e | 156041 | 6.1% |
| s | 149741 | 5.8% |
| g | 144194 | 5.6% |
| l | 142623 | 5.5% |
| o | 133948 | 5.2% |
| c | 111472 | 4.3% |
| a | 103764 | 4.0% |
| Other values (37) | 890816 |
Event
Text
| Distinct | 765 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 85 |
|---|---|
| Median length | 58 |
| Mean length | 32.063335 |
| Min length | 15 |
Characters and Unicode
| Total characters | 8692883 |
|---|---|
| Distinct characters | 69 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Basketball Men's Basketball |
|---|---|
| 2nd row | Judo Men's Extra-Lightweight |
| 3rd row | Football Men's Football |
| 4th row | Tug-Of-War Men's Tug-Of-War |
| 5th row | Speed Skating Women's 500 metres |
| Value | Count | Frequency (%) |
| men's | 182260 | 15.0% |
| women's | 71916 | 5.9% |
| metres | 70024 | 5.7% |
| athletics | 38624 | 3.2% |
| gymnastics | 27365 | 2.2% |
| individual | 25476 | 2.1% |
| swimming | 24136 | 2.0% |
| hockey | 21866 | 1.8% |
| team | 20722 | 1.7% |
| skiing | 18899 | 1.6% |
| Other values (428) | 717273 |
Most occurring characters
| Value | Count | Frequency (%) |
| 947445 | 10.9% | |
| e | 917589 | 10.6% |
| n | 614723 | 7.1% |
| s | 597800 | 6.9% |
| i | 522595 | 6.0% |
| t | 416689 | 4.8% |
| l | 413207 | 4.8% |
| o | 383702 | 4.4% |
| a | 323320 | 3.7% |
| m | 306751 | 3.5% |
| Other values (59) | 3249062 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6076066 | |
| Uppercase Letter | 1033959 | 11.9% |
| Space Separator | 947445 | 10.9% |
| Other Punctuation | 334267 | 3.8% |
| Decimal Number | 273212 | 3.1% |
| Dash Punctuation | 26342 | 0.3% |
| Open Punctuation | 794 | < 0.1% |
| Close Punctuation | 794 | < 0.1% |
| Math Symbol | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 917589 | |
| n | 614723 | |
| s | 597800 | |
| i | 522595 | |
| t | 416689 | 6.9% |
| l | 413207 | 6.8% |
| o | 383702 | 6.3% |
| a | 323320 | 5.3% |
| m | 306751 | 5.0% |
| r | 286237 | 4.7% |
| Other values (15) | 1293453 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 215454 | |
| S | 122248 | |
| W | 96251 | |
| A | 74247 | 7.2% |
| F | 64464 | 6.2% |
| C | 53817 | 5.2% |
| H | 52982 | 5.1% |
| T | 50659 | 4.9% |
| R | 49417 | 4.8% |
| B | 48549 | 4.7% |
| Other values (15) | 205871 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 155854 | |
| 1 | 39994 | 14.6% |
| 5 | 25528 | 9.3% |
| 4 | 25157 | 9.2% |
| 2 | 14790 | 5.4% |
| 3 | 5109 | 1.9% |
| 8 | 3429 | 1.3% |
| 7 | 1944 | 0.7% |
| 6 | 1332 | 0.5% |
| 9 | 75 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 254702 | |
| , | 76004 | 22.7% |
| . | 2365 | 0.7% |
| / | 1196 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 947445 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 26342 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 794 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 794 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7110025 | |
| Common | 1582858 | 18.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 917589 | 12.9% |
| n | 614723 | 8.6% |
| s | 597800 | 8.4% |
| i | 522595 | 7.4% |
| t | 416689 | 5.9% |
| l | 413207 | 5.8% |
| o | 383702 | 5.4% |
| a | 323320 | 4.5% |
| m | 306751 | 4.3% |
| r | 286237 | 4.0% |
| Other values (40) | 2327412 |
Common
| Value | Count | Frequency (%) |
| 947445 | ||
| ' | 254702 | 16.1% |
| 0 | 155854 | 9.8% |
| , | 76004 | 4.8% |
| 1 | 39994 | 2.5% |
| - | 26342 | 1.7% |
| 5 | 25528 | 1.6% |
| 4 | 25157 | 1.6% |
| 2 | 14790 | 0.9% |
| 3 | 5109 | 0.3% |
| Other values (9) | 11933 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8692883 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 947445 | 10.9% | |
| e | 917589 | 10.6% |
| n | 614723 | 7.1% |
| s | 597800 | 6.9% |
| i | 522595 | 6.0% |
| t | 416689 | 4.8% |
| l | 413207 | 4.8% |
| o | 383702 | 4.4% |
| a | 323320 | 3.7% |
| m | 306751 | 3.5% |
| Other values (59) | 3249062 |
Medal
Categorical
MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 231333 |
| Missing (%) | 85.3% |
| Memory size | 2.1 MiB |
| Gold | |
|---|---|
| Bronze | |
| Silver |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.3277531 |
| Min length | 4 |
Characters and Unicode
| Total characters | 211954 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Gold |
|---|---|
| 2nd row | Bronze |
| 3rd row | Bronze |
| 4th row | Bronze |
| 5th row | Bronze |
Common Values
| Value | Count | Frequency (%) |
| Gold | 13372 | 4.9% |
| Bronze | 13295 | 4.9% |
| Silver | 13116 | 4.8% |
| (Missing) | 231333 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| gold | 13372 | |
| bronze | 13295 | |
| silver | 13116 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 26667 | |
| l | 26488 | |
| r | 26411 | |
| e | 26411 | |
| G | 13372 | |
| d | 13372 | |
| B | 13295 | |
| n | 13295 | |
| z | 13295 | |
| S | 13116 | |
| Other values (2) | 26232 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 172171 | |
| Uppercase Letter | 39783 | 18.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 26667 | |
| l | 26488 | |
| r | 26411 | |
| e | 26411 | |
| d | 13372 | |
| n | 13295 | |
| z | 13295 | |
| i | 13116 | |
| v | 13116 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 13372 | |
| B | 13295 | |
| S | 13116 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 211954 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 26667 | |
| l | 26488 | |
| r | 26411 | |
| e | 26411 | |
| G | 13372 | |
| d | 13372 | |
| B | 13295 | |
| n | 13295 | |
| z | 13295 | |
| S | 13116 | |
| Other values (2) | 26232 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 211954 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 26667 | |
| l | 26488 | |
| r | 26411 | |
| e | 26411 | |
| G | 13372 | |
| d | 13372 | |
| B | 13295 | |
| n | 13295 | |
| z | 13295 | |
| S | 13116 | |
| Other values (2) | 26232 |
| ID | Age | Height | Weight | Year | Sex | Season | City | Medal | |
|---|---|---|---|---|---|---|---|---|---|
| ID | 1.000 | -0.002 | -0.011 | -0.012 | 0.013 | 0.029 | 0.038 | 0.024 | 0.000 |
| Age | -0.002 | 1.000 | 0.145 | 0.217 | 0.001 | 0.251 | 0.084 | 0.090 | 0.000 |
| Height | -0.011 | 0.145 | 1.000 | 0.827 | 0.050 | 0.489 | 0.117 | 0.060 | 0.019 |
| Weight | -0.012 | 0.217 | 0.827 | 1.000 | 0.009 | 0.537 | 0.070 | 0.050 | 0.017 |
| Year | 0.013 | 0.001 | 0.050 | 0.009 | 1.000 | 0.292 | 0.162 | 0.912 | 0.015 |
| Sex | 0.029 | 0.251 | 0.489 | 0.537 | 0.292 | 1.000 | 0.037 | 0.257 | 0.000 |
| Season | 0.038 | 0.084 | 0.117 | 0.070 | 0.162 | 0.037 | 1.000 | 1.000 | 0.000 |
| City | 0.024 | 0.090 | 0.060 | 0.050 | 0.912 | 0.257 | 1.000 | 1.000 | 0.000 |
| Medal | 0.000 | 0.000 | 0.019 | 0.017 | 0.015 | 0.000 | 0.000 | 0.000 | 1.000 |
| ID | Name | Sex | Age | Height | Weight | Team | NOC | Games | Year | Season | City | Sport | Event | Medal | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | A Dijiang | M | 24.0 | 180.0 | 80.0 | China | CHN | 1992 Summer | 1992 | Summer | Barcelona | Basketball | Basketball Men's Basketball | NaN |
| 1 | 2 | A Lamusi | M | 23.0 | 170.0 | 60.0 | China | CHN | 2012 Summer | 2012 | Summer | London | Judo | Judo Men's Extra-Lightweight | NaN |
| 2 | 3 | Gunnar Nielsen Aaby | M | 24.0 | NaN | NaN | Denmark | DEN | 1920 Summer | 1920 | Summer | Antwerpen | Football | Football Men's Football | NaN |
| 3 | 4 | Edgar Lindenau Aabye | M | 34.0 | NaN | NaN | Denmark/Sweden | DEN | 1900 Summer | 1900 | Summer | Paris | Tug-Of-War | Tug-Of-War Men's Tug-Of-War | Gold |
| 4 | 5 | Christine Jacoba Aaftink | F | 21.0 | 185.0 | 82.0 | Netherlands | NED | 1988 Winter | 1988 | Winter | Calgary | Speed Skating | Speed Skating Women's 500 metres | NaN |
| 5 | 5 | Christine Jacoba Aaftink | F | 21.0 | 185.0 | 82.0 | Netherlands | NED | 1988 Winter | 1988 | Winter | Calgary | Speed Skating | Speed Skating Women's 1,000 metres | NaN |
| 6 | 5 | Christine Jacoba Aaftink | F | 25.0 | 185.0 | 82.0 | Netherlands | NED | 1992 Winter | 1992 | Winter | Albertville | Speed Skating | Speed Skating Women's 500 metres | NaN |
| 7 | 5 | Christine Jacoba Aaftink | F | 25.0 | 185.0 | 82.0 | Netherlands | NED | 1992 Winter | 1992 | Winter | Albertville | Speed Skating | Speed Skating Women's 1,000 metres | NaN |
| 8 | 5 | Christine Jacoba Aaftink | F | 27.0 | 185.0 | 82.0 | Netherlands | NED | 1994 Winter | 1994 | Winter | Lillehammer | Speed Skating | Speed Skating Women's 500 metres | NaN |
| 9 | 5 | Christine Jacoba Aaftink | F | 27.0 | 185.0 | 82.0 | Netherlands | NED | 1994 Winter | 1994 | Winter | Lillehammer | Speed Skating | Speed Skating Women's 1,000 metres | NaN |
| ID | Name | Sex | Age | Height | Weight | Team | NOC | Games | Year | Season | City | Sport | Event | Medal | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 271106 | 135565 | Fernando scar Zylberberg | M | 27.0 | 168.0 | 76.0 | Argentina | ARG | 2004 Summer | 2004 | Summer | Athina | Hockey | Hockey Men's Hockey | NaN |
| 271107 | 135566 | James Francis "Jim" Zylker | M | 21.0 | 175.0 | 75.0 | United States | USA | 1972 Summer | 1972 | Summer | Munich | Football | Football Men's Football | NaN |
| 271108 | 135567 | Aleksandr Viktorovich Zyuzin | M | 24.0 | 183.0 | 72.0 | Russia | RUS | 2000 Summer | 2000 | Summer | Sydney | Rowing | Rowing Men's Lightweight Coxless Fours | NaN |
| 271109 | 135567 | Aleksandr Viktorovich Zyuzin | M | 28.0 | 183.0 | 72.0 | Russia | RUS | 2004 Summer | 2004 | Summer | Athina | Rowing | Rowing Men's Lightweight Coxless Fours | NaN |
| 271110 | 135568 | Olga Igorevna Zyuzkova | F | 33.0 | 171.0 | 69.0 | Belarus | BLR | 2016 Summer | 2016 | Summer | Rio de Janeiro | Basketball | Basketball Women's Basketball | NaN |
| 271111 | 135569 | Andrzej ya | M | 29.0 | 179.0 | 89.0 | Poland-1 | POL | 1976 Winter | 1976 | Winter | Innsbruck | Luge | Luge Mixed (Men)'s Doubles | NaN |
| 271112 | 135570 | Piotr ya | M | 27.0 | 176.0 | 59.0 | Poland | POL | 2014 Winter | 2014 | Winter | Sochi | Ski Jumping | Ski Jumping Men's Large Hill, Individual | NaN |
| 271113 | 135570 | Piotr ya | M | 27.0 | 176.0 | 59.0 | Poland | POL | 2014 Winter | 2014 | Winter | Sochi | Ski Jumping | Ski Jumping Men's Large Hill, Team | NaN |
| 271114 | 135571 | Tomasz Ireneusz ya | M | 30.0 | 185.0 | 96.0 | Poland | POL | 1998 Winter | 1998 | Winter | Nagano | Bobsleigh | Bobsleigh Men's Four | NaN |
| 271115 | 135571 | Tomasz Ireneusz ya | M | 34.0 | 185.0 | 96.0 | Poland | POL | 2002 Winter | 2002 | Winter | Salt Lake City | Bobsleigh | Bobsleigh Men's Four | NaN |
Most frequently occurring
| ID | Name | Sex | Age | Height | Weight | Team | NOC | Games | Year | Season | City | Sport | Event | Medal | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 379 | 77710 | Robert Tait McKenzie | M | 65.0 | NaN | NaN | Canada | CAN | 1932 Summer | 1932 | Summer | Los Angeles | Art Competitions | Art Competitions Mixed Sculpturing, Unknown Event | NaN | 43 |
| 404 | 83312 | Alfred James Munnings | M | 69.0 | NaN | NaN | Great Britain | GBR | 1948 Summer | 1948 | Summer | London | Art Competitions | Art Competitions Mixed Painting, Unknown Event | NaN | 25 |
| 49 | 12380 | Acee Blue Eagle | M | 24.0 | NaN | NaN | United States | USA | 1932 Summer | 1932 | Summer | Los Angeles | Art Competitions | Art Competitions Mixed Painting, Unknown Event | NaN | 17 |
| 363 | 74532 | Miltiades Manno | M | 53.0 | NaN | 76.0 | Hungary | HUN | 1932 Summer | 1932 | Summer | Los Angeles | Art Competitions | Art Competitions Mixed Painting, Unknown Event | NaN | 17 |
| 415 | 86677 | Stanisaw Noakowski | M | 61.0 | NaN | NaN | Poland | POL | 1928 Summer | 1928 | Summer | Amsterdam | Art Competitions | Art Competitions Mixed Painting, Drawings And Water Colors | NaN | 17 |
| 114 | 28407 | Wilhelm (William) Hunt Diederich | M | 48.0 | NaN | NaN | United States | USA | 1932 Summer | 1932 | Summer | Los Angeles | Art Competitions | Art Competitions Mixed Painting, Unknown Event | NaN | 16 |
| 607 | 134046 | ngel Zrraga Argelles | M | 41.0 | NaN | NaN | Mexico | MEX | 1928 Summer | 1928 | Summer | Amsterdam | Art Competitions | Art Competitions Mixed Painting, Paintings | NaN | 16 |
| 212 | 44875 | Alfrd (Arnold-) Hajs (Guttmann-) | M | 50.0 | NaN | NaN | Hungary | HUN | 1928 Summer | 1928 | Summer | Amsterdam | Art Competitions | Art Competitions Mixed Architecture, Architectural Designs | NaN | 14 |
| 213 | 44875 | Alfrd (Arnold-) Hajs (Guttmann-) | M | 50.0 | NaN | NaN | Hungary | HUN | 1928 Summer | 1928 | Summer | Amsterdam | Art Competitions | Art Competitions Mixed Architecture, Designs For Town Planning | NaN | 14 |
| 61 | 14083 | Marcel Bouraine | M | NaN | NaN | NaN | France | FRA | 1924 Summer | 1924 | Summer | Paris | Art Competitions | Art Competitions Mixed Sculpturing | NaN | 13 |